Inventi Impact: Information Security

Articles

Inventi:eis/20594/16

Feature Selection for Intrusion Detection Using Random Forest

01-Jan-1970 Research 2016 : October - December

Md Al Mehedi Hasan, Mohammed Nasser, Shamim Ahmad, Khademul Islam Molla

An intrusion detection system collects and analyzes information from different areas within a\ncomputer or a network to identify possible security threats that include threats from both outside\nas well as inside of the organization. It deals with large amount of data, which contains various irrelevant\nand redundant features and results in increased processing time and low detection rate.\nTherefore, feature selection should be treated as an indispensable pre-processing step to improve\nthe overall system performance significantly while mining on huge datasets. In this context, in this\npaper, we focus on a two-step approach of feature selection based on Random Forest. The first\nstep selects the features with higher variable importance score and guides the initialization of\nsearch process for the second step whose outputs the final feature subset for classification and interpretation.\nThe effectiveness of this algorithm is demonstrated on KDDÃ¢â?¬â?¢99 intrusion detection\ndatasets, which are based on DARPA 98 dataset, provides labeled data for researchers working in\nthe field of intrusion detection. The important deficiency in the KDDÃ¢â?¬â?¢99 data set is the huge number\nof redundant records as observed earlier. Therefore, we have derived a data set RRE-KDD by\neliminating redundant record from KDDÃ¢â?¬â?¢99 train and test dataset, so the classifiers and feature\nselection method will not be biased towards more frequent records. This RRE-KDD consists of\nboth KDD99Train+ and KDD99Test+ dataset for training and testing purposes, respectively. The\nexperimental results show that the Random Forest based proposed approach can select most important\nand relevant features useful for classification, which, in turn, reduces not only the number\nof input features and time but also increases the classification accuracy.

How to Cite this Article
CC Compliant Citation: Hasan, M.A.M., Nasser, M., Ahmad, S. and Molla, K.I. (2016) Feature Selection for Intrusion Detection\nUsing Random Forest. Journal of Information Security, 7, 129-140. http://dx.doi.org/10.4236/jis.2016.73009, https://\ncreativecommons.org/licenses/by/4.0/.
Download Full Text

Call Us: +4 (800) 888-0008

Inventi Impact: Information Security

Articles

Inventi:eis/20594/16

Feature Selection for Intrusion Detection Using Random Forest

How to Cite this Article

Links

Contact Us